Tuning a Grammar Correction System for Increased Precision
نویسندگان
چکیده
In this paper, we propose two enhancements to a statistical machine translation based approach to grammar correction for correcting all error categories. First, we propose tuning the SMT systems to optimize a metric more suited to the grammar correction task (F-β score) rather than the traditional BLEU metric used for tuning language translation tasks. Since the F-β score favours higher precision, tuning to this score can potentially improve precision. While the results do not indicate improvement due to tuning with the new metric, we believe this could be due to the small number of grammatical errors in the tuning corpus and further investigation is required to answer the question conclusively. We also explore the combination of custom-engineered grammar correction techniques, which are targeted to specific error categories, with the SMT based method. Our simple ensemble methods yield improvements in recall but decrease the precision. Tuning the custom-built techniques can help in increasing the overall accuracy also.
منابع مشابه
Transformation-Based Learning of Danish Grammar Correction
We describe a technique for using the Brill Tagger to learn to identify grammar errors. We have applied this technique to two types of Danish grammar errors: incorrect commas, and incorrect article-noun agreement. The system identi es comma errors with a precision of 91%, while agreement errors are identi ed with 95% precision, with many of the system errors resulting from de ciencies in the ta...
متن کاملDesign and implementation of Persian spelling detection and correction system based on Semantic
Persian Language has a special feature (grapheme, homophone, and multi-shape clinging characters) in electronic devices. Furthermore, design and implementation of NLP tools for Persian are more challenging than other languages (e.g. English or German). Spelling tools are used widely for editing user texts like emails and text in editors. Also developing Persian tools will provide Persian progr...
متن کاملIranian EFL High School Students’ Perceptions Regarding Written Grammar Feedback
This paper reports on a study thatinvestigated Iranian EFL high school students’ perceptions of written grammar feedback to specify their reasons for preferring comprehensive or selective feedback and choosing some feedback strategies. A questionnaire was administered to 100 EFL intermediate high school students who were selected based on their scores on a proficiency test. Moreover, semi-struc...
متن کاملPhrase-based Machine Translation is State-of-the-Art for Automatic Grammatical Error Correction
In this work, we study parameter tuning towards the M2 metric, the standard metric for automatic grammar error correction (GEC) tasks. After implementing M2 as a scorer in the Moses tuning framework, we investigate interactions of dense and sparse features, different optimizers, and tuning strategies for the CoNLL-2014 shared task. We notice erratic behavior when optimizing sparse feature weigh...
متن کاملGenieTutor: a computer assisted second-language learning system based on semantic and grammar correctness evaluations
This paper introduces a Dialog-Based Computer-Assisted secondLanguage Learning (DB-CALL) system using semantic and grammar correctness evaluations and the results of its experiment. While the system dialogues with English learners about a given topic, it automatically evaluates the grammar and content properness of their English utterances, then gives corrective feedback on grammar and semantic...
متن کامل